Integrating Triangle and Jaccard similarities for recommendation

نویسندگان

  • Shuang-Bo Sun
  • Zhi-Heng Zhang
  • Xin-Ling Dong
  • Heng-Ru Zhang
  • Tong-Jun Li
  • Lin Zhang
  • Fan Min
چکیده

This paper proposes a new measure for recommendation through integrating Triangle and Jaccard similarities. The Triangle similarity considers both the length and the angle of rating vectors between them, while the Jaccard similarity considers non co-rating users. We compare the new similarity measure with eight state-of-the-art ones on four popular datasets under the leave-one-out scenario. Results show that the new measure outperforms all the counterparts in terms of the mean absolute error and the root mean square error.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A note on the triangle inequality for the Jaccard distance

Two simple proofs of the triangle inequality for the Jaccard distance in terms of nonnegative, monotone, submodular functions are given and discussed. The Jaccard index [8] is a classical similarity measure on sets with a lot of practical applications in information retrieval, data mining, machine learning, and many more (cf., e.g., [7]). Measuring the relative size of the overlap of two finite...

متن کامل

Improving Neighborhood-Based Collaborative Filtering by a Heuristic Approach and an Adjusted Similarity Measure

“Collaborative filtering” is the most used approach in recommendation systems since it provides good predictions. However, it still suffers from many drawbacks such as sparsity and scalability problems especially for huge datasets which consist of a large number of users and items. This paper presents a new algorithm for neighborhood selection based on two heuristic approaches. The first of whi...

متن کامل

Investigating the Factors Affecting the Whistle-blowing Intentions Using the Fraud Triangle and the Theory of Planned Behavior: An Accountant's Perspective

Nowadays, whistle-blowing is an important mechanism for detecting and preventing fraud. The purpose of the present research is to investigate factors that influence accountants’ intentions of whistle-blowing by integrating the fraud triangle components, as the external environmental factors that determine whistleblowing intention, and the extended theory of planned behavior, as the individual i...

متن کامل

Automatic Hashtag Recommendation in Social Networking and Microblogging Platforms Using a Knowledge-Intensive Content-based Approach

In social networking/microblogging environments, #tag is often used for categorizing messages and marking their key points. Also, since some social networks such as twitter apply restrictions on the number of characters in messages, #tags can serve as a useful tool for helping users express their messages. In this paper, a new knowledge-intensive content-based #tag recommendation system is intr...

متن کامل

ارائه یک الگوریتم خوشه بندی برای داده های دسته ای با ترکیب معیارها

Clustering is one of the main techniques in data mining. Clustering is a process that classifies data set into groups. In clustering, the data in a cluster are the closest to each other and the data in two different clusters have the most difference. Clustering algorithms are divided into two categories according to the type of data: Clustering algorithms for numerical data and clustering algor...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره 12  شماره 

صفحات  -

تاریخ انتشار 2017